Using Audio Based Disambiguation for Improving Handwritten Mathematical Content Recognition in Classroom Videos
نویسندگان
چکیده
We consider the problem of recognizing handwritten mathematical content in classroom videos that capture the content written on the whiteboard and the content spoken by the instructor. While the problem of recognizing handwritten textual content from videos has been studied before, recognition of handwritten mathematical content and the use of audio content from classroom videos to assist in recognition, however, presents us with a new set of challenges. In this paper, we outline such challenges, and present the description of an end to end system that makes use of both video and audio based recognition components to improve the accuracy of handwritten mathematical content recognition. We have implemented the system using an open source implementation of a text recognizer and a commercially available phonetic word spotter. Preliminary results reported in this paper demonstrate the viability of our approach.
منابع مشابه
Audio-video based character recognition for handwritten mathematical content in classroom videos
Recognizing handwritten mathematical content is a challenging problem, and more so when such content appears in classroom videos. However, given the fact that in such videos the handwritten text and the accompanying audio refer to the same content, a combination of a video and an audio based recognizer has the potential to significantly improve the content recognition accuracy. In this paper, u...
متن کاملUse of the Shearlet Transform and Transfer Learning in Offline Handwritten Signature Verification and Recognition
Despite the growing growth of technology, handwritten signature has been selected as the first option between biometrics by users. In this paper, a new methodology for offline handwritten signature verification and recognition based on the Shearlet transform and transfer learning is proposed. Since, a large percentage of handwritten signatures are composed of curves and the performance of a sig...
متن کاملPersian Handwritten Digit Recognition Using Particle Swarm Probabilistic Neural Network
Handwritten digit recognition can be categorized as a classification problem. Probabilistic Neural Network (PNN) is one of the most effective and useful classifiers, which works based on Bayesian rule. In this paper, in order to recognize Persian (Farsi) handwritten digit recognition, a combination of intelligent clustering method and PNN has been utilized. Hoda database, which includes 80000 P...
متن کاملSemantic Multi-modal Analysis, Structuring, and Visualization for Candid Personal Interaction Videos
Videos are rich in multimedia content and semantics, which should be used by video browsers to better present the audio-visual information to the viewer. Ubiquitous video players allow for content to be scanned linearly, rarely providing summaries or methods for searching. Through analysis of audio and video tracks, it is possible to extract text transcripts from audio, displayed text from vide...
متن کاملNeural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten
Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...
متن کامل